Crafting small databases for unit selection TTS: effects on intelligibility

نویسنده

  • H. Timothy Bunnell
چکیده

When creating unit selection voices for personal use, e.g., for use in communication aids, it is often desirable to keep the speech database as small as possible. The present study examines the effects of database size and database content on the intelligibility of synthetic speech produced by the latest version of the ModelTalker TTS system. Intelligibility here is measured objectively with an open response SU sentence task. While previous work has examined similar questions, that work has typically been with an eye toward completeness of the database coverage and using tasks that assess perceptual quality, but not explicitly intelligibility.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A hybrid TTS between unit selection and HMM-based TTS under limited data conditions

The intelligibility of HMM-based TTS can reach that of the original speech. However, HMM-based TTS is far from natural. On the contrary, unit selection TTS is the most-natural sounding TTS currently. However, its intelligibility and naturalness on segmental duration and timing are not stable. Additionally, unit selection needs to store a huge amount of data for concatenation. Recently, hybrid a...

متن کامل

Evaluation of Finnish unit selection and HMM-based speech synthesis

Unit selection and hidden Markov model (HMM) based synthesis have become the dominant techniques in text-to-speech (TTS) research. In this work, we combine HMM-based signal generation with the front end originally designed for unit selection based Finnish TTS and we evaluate the prosody of the output generated by the two synthesis techniques using the same speech database. Furthermore, we study...

متن کامل

Text To Speech for Bangla Language using Festival

In this paper, we present a Text to Speech (TTS) synthesis system for Bangla language using the opensource Festival TTS engine. Festival is a complete TTS synthesis system, with components supporting front-end processing of the input text, language modeling, and speech synthesis using its signal processing module. The Bangla TTS system proposed here, creates the voice data for festival, and add...

متن کامل

Phonetically enriched labeling in unit selection TTS synthesis

Unit selection techniques have improved the quality of textto-speech (TTS) synthesis. However, mistakes which had been less noticeable previously in poorer quality synthetic speech become very noticeable in more natural-sounding synthetic speech. Many problems appear to be caused by mismatches between phones requested by the TTS frontend and phones selected from the labeled speech inventory. Gi...

متن کامل

Text-To-Speech Intelligibility Across Speech Rates

A web-based listening test measured intelligibility across speech rate of 8 TTS systems and of a linearly timecompressed human speech reference voice. The synthesis systems included 2 independent representatives of each of the following 4 synthesis methods: formant, diphone concatenation, unit selection concatenation, and HMM. For each TTS system, a female and a male American English voice were...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010